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CN . Abstract 

T-H ' We review why the Thomas rotation is a crucial facet of special relativity, that is just 

^ ■ as fundamental, and just as "unintuitive" and "paradoxical", as such traditional effects 

i as length contraction, time dilation, and the ambiguity of simultaneity. We show how 

CN ] this phenomenon can be quite naturally introduced and investigated in the context 

of a typical introductory course on special relativity, in a way that is appropriate 
for, and completely accessible to, undergraduate students. We also demonstrate, in a 
O I more advanced section aimed at the graduate student studying the Dirac equation and 

' relativistic quantum field theory, that careful consideration of the Thomas rotation 

will become vital as modern experiments in particle physics continue to move from 
Qh' unpolarized to polarized cross-sections. 

^ ■ 

> : 

I I. Introduction 

' Recently, a number of the current authors have reviewed how aspects of relativistic quan- 

tum mechanics can be appreciated from the point of view of relativistic classical mechanics. 
In Ref. 1, the Foldy-Wouthuysen transformation was reviewed, where it was emphasized 
that many of the operators of the Dirac equation become, after transformation, completely 
recognizable from the point of view of classical physics. In Ref. 2, the Feynman-Stueckelberg 
formulation of antiparticles was reviewed, entirely within the domain of classical mechanics, 
and it was emphasized that one can make good sense of antiparticle motion without needing 
to resort to quantum mechanical arguments. 

In extending these ideas to the domain of quantum field theory, we have found that there 
is a third aspect of classical relativistic mechanics that is of crucial theoretical and practical 
importance, but which rates scarcely a mention in most textbooks on special relativity: the 
Thomas rotation. (In the case of a continuous evolution of infinitesimal rotations, this effect 
is usually referred to as the Thomas precession; but we are here mainly concerned with the 
more general case of a single, finite rotation.) Historically, the relative obscurity of this effect 
can, perhaps, be traced to the fact that the special theory of relativity was two decades old 
before Thomas made his discovery. Pais's summary of events^ is instructive: 
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Twenty years later [after his seminal 1905 paper on special relativity], Einstein heard 
something about the Lorentz group that greatly surprised him. It happened while he 
was in Leiden. In October 1925 George Eugene Uhlenbeck and Samuel Goudsmit had 
discovered the spin of the electron and thereby explained the occurrence of the alkali 
doublets, but for a brief period it appeared that the magnitude of the doublet splitting 
did not come out correctly. Then Llewellyn Thomas supplied the missing factor, 2, 
now known as the Thomas factor. Uhlenbeck told me that he did not understand a 
word of Thomas's work when it first came out. 'I remember that, when I first heard 
about it, it seemed unbelievable that a relativistic effect could give a factor of 2 instead 
of something of order vjc .... Even the cognoscenti of the relativity theory (Einstein 
included!) were quite surprised.' At the heart of the Thomas precession lies the fact 
that a Lorentz transformation with velocity V\ followed by a second one with velocity 
Vi in a different direction does not lead to the same inertial frame as one single Lorentz 
transformation with the velocity V\-\-V2- (It took Pauli a few weeks before he grasped 
Thomas's point.) 

It seems remarkable — but, according to the above account, undeniable — ^that neither Einstein 
nor Pauli came across the Thomas rotation before 1925. However, the effect we now call the 
Thomas rotation was known before Thomas's paper. The early history has been traced by 
Ungar.^ 

Now, most textbooks on special relativity follow the extraordinarily clear exposition of 
the theory given by Einstein in his seminal paper. Unfortunately, this has meant that little 
or no attention has usually been given to the "Thomas effect", which has generally been 
relegated to a brief mention in textbooks on quantum mechanics and atomic structure. As 
far as we are aware, the best treatment of the Thomas •precession in a textbook still in 
print is arguably that contained in Jackson's book on classical electrodynamics.^ A similar 
discussion is given in Goldstein,^ who emphasizes the complexity of the general calculations. 
This complexity is inhibiting to both the writers and the readers of the textbooks. 

That the Thomas rotation, or precession, still puzzles students and their teachers can 
be discerned from the pages of this journal. In Question 7^57, MacKeown^ asks ". . . is said 
to introduce a velocity independent constant factor. Can any simple, convincing, argument 
be given for this?" Ungar^ and Goedecke^ have countered the complexity by introducing 
new formalisms, a "weakly associative-commutative groupoid" by Ungar, and the tetrad 
formalism by Goedecke. While they offer useful insights, and emphasise the Thomas rotation, 
they are not well suited to the introductory course. Muller^ (in the Appendix) and Philpott^'^ 
(to introduce the main point of his paper) give derivations of the Thomas precession which 
are related to the present one. But we believe that the straightforward treatment below, 
and its emphasis on Thomas rotation, offers conceptual and pedagogical advantages which 
make it suitable to an introductory course. 

In this paper, we show how an instructive, elementary, and intriguing discussion of the 
Thomas rotation can be "grafted on" to any standard introductory course on special rela- 
tivity. As prerequisite we assume nothing more than the standard expression for a Lorentz 
boost along the x-axis of a system of coordinates. For simplicity, we also make use of the 
energy-momentum four-vector, as well as matrix multiplication, although such references 
could be deleted if thought necessary, albeit at the expense of rendering the algebra a little 
less transparent. (The widespread availability of calculators and computer programs capa- 
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ble of matrix multiplication means that the complexities of the following calculations can be 
drastically minimized by the use of matrix notation — leaving more time for the contempla- 
tion of the physical results.) These preliminaries are covered in Sec. II. 

In Sec. Ill, we show how these simple building blocks can be put together to create a 
sequence of intriguing and completely counterintuitive "paradoxes". This material could be 
presented almost verbatim in any introductory course on special relativity. 

Sec. IV provides full solutions and explanations of these elementary Thomas rotation 
"paradoxes", and general expressions are derived for the Thomas rotation in arbitrary cases. 

In Sec. V, we provide a further "paradox" in the context of the polarization properties 
of the scattering of a Dirac particle. This example is more advanced, in that it presumes 
familiarity with at least an introductory level of quantum field theory; and thus it would 
not usually be appropriate for an introductory course on special relativity. On the other 
hand, this example is arguably much more practically important than the others, in that it 
shows that real-life calculations of scattering cross-sections can be completely erroneous if 
due regard is not taken of this subtle facet of relativistic kinematics. Sec. VI provides a full 
solution of this "paradox". 

Finally, Sec. VII summarizes our conclusions. 



II. Preliminaries 



In this section we review those features of special relativity which we would assume to 
have been taught in an introductory course before the discussion of Thomas rotation, to set 
the scene and to establish our notation. 

Throughout this paper, we shall use a "naturalized" set of units, in which c = 1. To 
convert any expression to SI units, one need simply replace t by at, v hj v/c, E hj E/c'^, 
and p by p/c. (Boldface denotes a three- vector.) In Sees. V and VI we shall also use units 
in which h — 1. 

The Lorentz transformation from a frame S with coordinates {t,x,y,z), to a frame S', 
moving with respect to S with a velocity v along the x-axis, in which the coordinates are 
{t',x',y',z'), is 



t' — 7(t — vx), 



x' — 7(x — vt), 



(1) 



where 



1 



7 = 



y/l — v'^ 

This transformation, written in matrix notation, is 
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We shall denote the matrix that effects this boost by velocity v in the x-direction as Bx{v): 



B,{v) = 



( 7 —7V \ 

— 7^ 7 

10 

1 



(4) 



Clearly, a boost by velocity v in the t/-direction or in the z-direction would be effected by 
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(5) 



The energy-momentum four-vector , 



( ^ \ 
py 



(6) 



shall play a key role in our analysis. A particle of mass m, at rest in a system of coordinates, 
has E = m and p = 0. If we boost our system of coordinates by the velocity —v in the 
x-direction (so that, relative to this new system of coordinates, the particle has velocity -\-v 
in the x-direction) , then the application of Bx{—v) yields 
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This imphes that the x-velocity of the particle can be "extracted" from the components 
of its four- momentum by computing the ratio p^/ E. Since the x-direction is arbitrary, the 
immediate generalization to a particle moving with velocity v in any direction is 



P 



(7) 



To obtain the law for the composition of two velocities Vi and f 2 in the same direction, we 
may simply apply Bx{—Vi) followed by Bx{—V2) to a particle at rest: 



Bx{-V2) B^{-Vi] 



1 m \ 
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(8) 



(In this and subsequent equations we shall take it to be understood that the four components 
of the column vector refer to the components of the four- momentum.) On using (7), Eq. (8) 
yields 



1 V1V2 ' 



(9) 
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namely, the standard result. It will be noted that the same result would have been obtained 
if we had applied Bx{—V2) first and B^^—vi) second: boosts in the same direction commute. 



III. Some elementary Thomas rotation "paradoxes" 

Let us now apply the results obtained in Sec. II to some hypothetical maneuvers of the 
USS Enterprise under impulse power. In the following, the system of coordinates being 
considered is that of an observer on board the bridge of the Enterprise. 

Let us assume that the Enterprise begins at rest relative to some particular star. We 
ignore gravitational effects, so that if the Enterprise were to not fire any thrusters, then it 
would remain at rest relative to the star. 

Let us now apply a boost to the Enterprise by some velocity Vq in the x-direction, and 
follow it by a boost by the velocity —Vq, again in the x-direction. We expect that the net 
effect on the velocity of the Enterprise would be zero: it would move in the x-direction during 
the maneuver (by what distance is of no interest to us here), but at the end of the maneuver 
it would again be at rest relative to the star. We can confirm this by considering the effect 
of applying B^^vq) and then Bx{—vo) on, say, the components of the four-momentum of the 
star, as observed from the Enterprise. If the star has the mass m, then a straightforward 
calculation verifies that 



Bx{-vo) Bx{vo) 
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/ m \ 
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V J 



and indeed Bx{—Vo) Bx{vo) = I, where / is the identity matrix. (In performing these calcu- 
lations, and those that follow, it is useful to replace even powers of v, wherever they occur, 
by means of the identity 



Y 



(10) 



which can be derived from the definition (2).) We can perform the same maneuver in the 
y-direction: namely. 



By{-V0) By{V0) 



1 m \ 




/ m \ 
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and By(—vo) By{vo) = I, again as expected. 



Having thus verified the action of our "thrusters" in the x- and y-directions, by means 

of these four boosts, let us now try another test maneuver, by mixing the order of these 
boosts. Namely, let us apply Bj;{vq), followed by By{vo), then Bx{—vq), and finally By{—Vo). 
Again, we expect that the star will be at rest, relative to the Enterprise., at the end of the 
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maneuver. However, if we perform the calculations, then (after some algebra) we find 
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(11) 



Something has gone wrong! Instead of ending up with the star at rest, we find that it is now 
"drifting". What has happened? 

One can repeat and check the algebraic calculations above as many times and in as many 
ways as one wishes; but the result (11) is not a computational error. We can check its 
self-consistency by noting that, for any four-momentum of a particle of mass m, the identity 



m 



2 should be satisfied — as it indeed is for the components listed in (11). 



Moreover, if one simply changes the order of the final two boosts in (11), then one finds 



B:,{-Vq) By{-Vo) By{vo) B^{vo) 



I m \ 



V J 
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which would be unlikely to be true had we made any trivial error in computing any of the 
boost matrices. 

Let us therefore try to find out where our intuition has led us astray in the calculation (11), 
by breaking it down into smaller parts. We already know what happens to the components 
of the four-momentum of a particle, originally at rest, when wc STibjcct our system of co- 
ordinates to a single Lorentz boost, so let us consider instead the effect of the first two boosts 
in (11), namely, B^ivo) followed by By{vo). If we stop our calculation at this point, we find 
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(12) 



Now, since we have boosted the Enterprise in the positive- a; and positive-y directions, we 
expect that the star will be moving (relative to the Enterprise) with a negative velocity in 
the X- and y-directions; and this is borne out by the result (12). However, we are surprised 
to find that the x- and ^/-velocities are not equal, despite us boosting the Enterprise by 
the same velocity Vq in each direction! Indeed, making use of the relation (7) with the 
components (12) of p^, we find that the components of the three- velocity of the star, relative 
to the Enterprise, are given by 



7o' 



(13) 



Thus, the second (y) boost has been fully effective — but it has, in the process, reduced the 
velocity of the first {x) boost. 

Let us put this unexpected asymmetry to one side, for the moment, and return to our 
first perplexing result, namely, the nonzero velocity represented by Eq. (11). We have found 
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that the apphcation of B^^vo) and then By{vo) to the Enterprise leads to the star having 
the velocity components (13) (relative to the Enterprise) . Let us now consider the final two 
boosts in Eq. (11), namely, Bx{—Vo) followed by By{—vo). Instead of applying them after 

the first two boosts, let us instead apply them to the original Enterprise, which was at rest 
relative to the star. The effect of these two boosts on the components of the four-momentum 
of the star, in this modified scenario, would be 
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(14) 



leading to the velocity components 



7o' 



(15) 



Thus, comparing (13) and (15), we find that applying Bx{—Vo) and then By{—Vo) results 
in the exact opposite velocity to that obtained by applying Bx{vo) and then By{vo). (We 
would, of course, expect that this would be the case — but, given the problems we are having, 
it is essential to ensure that we do not make intuitive assumptions without testing them 
mathematically. ) 

We now find that our original result, Eq. (11), has not been clarified in the least. For we 
have shown that our sequence of four boosts can be broken down into a boost by the velocity 
components (13) (let us, for definiteness, refer to this three- velocity as Vxy), followed by a 

boost by the velocity components (15) (namely, —v^y). But surely Einstein's very derivation 
of the Lorentz transformation guarantees us that a boost by any velocity v, followed by a 
boost by —V, must return us to the original inertial frame? How, then, can we make any 
sense of the result (11), which seems to imply that 



B{-V,y)B{V,y)^n 



(16) 



Let us, again, put this problem to one side, and instead try the following tack: What if 

we were to perform four boosts, again in the +x, +y, —x, and —y directions respectively, but 
now adjusting the magnitude of each boost velocity so as to maintain some sort of control 
over the resultant overall velocity? Let us again start with a boost by velocity Vq in the 
positive-x direction: 
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We now apply a boost by some velocity vi (not equal to vo) in the positive-y direction: 
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Let us now adjust vi so that the overall velocity has equal components in the x- and y- 
directions (as was our original intention). We can do this by ensuring that and are 
equal — namely, by insisting that 



7iVi = vo- 

After some algebra, one finds that this is satisfied for 



71 



7o 



(17) 



so that 
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Let us now apply a boost by velocity V2 in the negative-x direction: 
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We can use this third boost to reduce the x-component of the velocity to zero by choosing 
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Finally, it is evident that we can apply the fourth boost by the original velocity Vq in the 
negative-y direction, resulting in 
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We finally seem to have found a sequence of four boosts, in the +x, +7/, — and — y directions 
respectively, that returns the Enterprise to a state of rest relative to the star at the end of 
the maneuver: namely, the sequence of boosts 



By{-VQ) Bx{-vi) By{vi) Bx{vo) 



(18) 
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together with the relation (17) between vi and vo- 

Let us now go back in time to our original Enterprise, at rest relative to the nearby star. 
The crew of the Enterprise had noted that there was a shuttlecraft, of mass m^, moving with 
velocity Vg in the positive-x direction; in other words, the components of its four-momentum, 
relative to the Enterprise, were 



( "msls \ 
rUs^sVs 



\ 








(19) 



What happens to the components of the four-momentum of the shuttlecraft after the se- 
quence of boosts (18)? We would expect that they — like those of the star — would be un- 
changed. However, if we perform the calculations, we find that 



msjsVs 





ms%VsV2-fl - l/7o 
ms-fsVs{l - I/to) 




V 



(20) 



We can't seem to take a trick! Even though the sequence of boosts (18) has left the velocity of 
the star unchanged, relative to the Enterprise, it has changed the velocity of the shuttlecraft. 
But how can this be possible? 



IV. Solutions to the elementary "paradoxes" 

Let us now discover the fallacies contained in the "paradoxes" described above. Wc shall 
begin by unraveling our chain of arguments, starting with the final "paradox", and working 
our way back to the first. By this stage, the reasons for each "paradox" will be clear. We 
shall complete this section by listing general expressions for the Thomas rotation in arbitrary 
cases. 

We begin with the result (20) for the final four- momentum of the shuttlecraft. We were 
surprised to find that it differed from the four-momentum (19) of the shuttlecraft prior to the 
sequence of boosts. However, on closer inspection, we find that the result is not total chaos. 
In particular, the energy of the shuttlecraft has not changed. This, in turn, implies that its 
speed is also unchanged — in other words, the final velocity has the same magnitude as the 
original velocity, but it has been rotated in space. We can confirm this by using Pythagoras's 
theorem to find the resultant of the components and Vy in (20); and we indeed find that 
its magnitude is simply Vs, the original speed of the shuttlecraft. 

This rotation of the spatial axes is what we know as the Thomas rotation. It almost 
always occurs when we apply a sequence of non-coUinear boosts that returns us to an inertial 
frame that is at rest relative to the original frame. This rotation had no net effect on the 
four-momentum of the star, because the star was at rest — the "spatial" components of its 
four- momentum vanished; in contrast, the motion of the shuttlecraft defines a direction in 
space (the x-direction, in the original inertial frame), that was subject to the rotation. 

We can obtain a clearer view of this rotation if we compute not the components of 
the four-momentum (20), but rather the entire matrix (18) that is applicable to arbitrary 
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four-vectors in the original frame: 
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(21) 
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The 2x2 matrix in the middle of this result is an orthogonal transformation, resulting in a 
rotation of the axes of the x-y plane by an angle 



In the nonrelativistic limit, the magnitude of 9 approaches radians (i.e., v'^/c^ in conven- 
tional units), and so is completely negligible for terrestrial applications. (Even if vq is set to 
the Earth's orbital velocity around the Sun, the Thomas rotation angle amounts to a mere 
0.004 seconds of arc.) In the ultrarelativistic hmit, on the other hand, 9 approaches —90° 
for this particular sequence of boosts. 

Defining the direction of the Thomas rotation, however, requires some care. Let us 
consider the above sequence of boosts from the point of view of an inertial observer, jettisoned 
from the Enterprise before the sequence of boosts commenced, who remained at rest relative 
to the star (and the distant "fixed stars" ) throughout the procedure. Relative to this fixed 
observer, the Enterprisers velocity rotated in the direction +x — > +y. The velocity of the 
shuttlecraft, as seen by the Enterprise, was rotated in this same direction. This means that, 
relative to the fixed observer, the axes of the Enterprise'' s coordinate system rotated in the 
opposite direction to its orbital rotation, as indicated by the minus sign in Eq. (22). This is 
a general feature of the Thomas rotation. 

Nonrelativistic physics has conditioned us to assume that Cartesian coordinate systems 
can be defined in space, in such a way that all inertial observers "agree" on the directions of 
the axes. The Thomas rotation demonstrates that this assumption requires an operational 
definition, as Einstein showed was necessary to clarify our understanding of the physics of 
relativity. For example, say that observer A defines a set of Cartesian axes. If observer B 
is at rest relative to A, then B can align her axes to "agree" with those of A. If observer 
A remains at rest, but observer B is boosted to some finite velocity relative to A, by one 
boost or by a sequence of boosts, then the resultant orientation of B 's axes depends on the 
particular sequence of boosts used. If such boosts are at all times in the same direction 
(relative to A, say), then it is meaningful to say that S's axes are still ahgned with A''s, in 
the sense that if we apply any sequence of boosts to B that is at all times collincar with 
this direction, that returns B to rest with respect to A, then their axes will be found to still 
point in the same directions. On the other hand, if B is, at any two times, siibjcct to boosts 
in different directions, then a sequence of boosts bringing B back to rest relative to A will, 
in general, lead to B finding her axes rotated relative to A's (unless the sequence of boosts 
"backtracked" precisely the original sequence). 




(22) 
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To discuss the general case we need the expression for a (simple) boost by velocity \v\ 
in the direction of v. We can obtain it most simply by noting that our original boost 
transformation along the x-axis, Bx{v) of Eq. (4), is an archetypical simple boost. If we 

rewrite B,j.{v) in three- covariant notation (i.e., in terms of three-vectors and three-vector 
operations, rather than individual components), then we know from vector analysis that the 
result will be the boost we require for v in an arbitrary direction. It is now straightforward 
to verify that^ 



t' — — ^{v-x), 



X 



X — ^tv + 



7 + 1 



[v-x)v, 



(23) 



is equivalent to (4) for v = (t>, 0,0); and thus (23) is the simple boost operation B{v) that 
we are seeking. (It is a straightforward calculation to confirm that the component of x in 
the direction of v satisfies the usual Lorentz transformation: 



vx 'yv-x 



V V 

and that the components of x normal to the velocity v are unchanged: 



vxx — vxx, 

since the component of x or a;' in the direction of v does not contribute to the cross product.) 
Written out in matrix form, we have 
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1 + - — - 
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(24) 



A sequence of two such boosts, which arc in different directions, is not a simple boost, but 
is rather a combination of a rotation and a simple boost. If we consider B{vi) followed by 
B{v2), and denote the velocity of the boost implied by the composite transformation as Vu, 
then the mathematical expression of this observation is that 

B{v2)B{Vi)=B{Vu)R{VuV2), 

where R{vi, V2) is a spatial rotation, depending on the velocities Vi and V2. 

Let us now return to the "unexpected asymmetry" in the result (13), namely, the fact 
that a boost by the velocity Vq in the x-direction, followed by a boost by Vq in the y-dircction, 
leads to 7^ Vy relative to the original frame of reference. We can understand the result 
Vy — —Vq by the following argument: Imagine that, after the x-boost, there is an object that 
is observed to be at rest. Applying the y-boost to ourselves, we have no choice but to observe 
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this object moving with velocity —vq in the y-direction. This same argument must apply to 
any object in the original frame which had no velocity in the y-direction. 

The reduction in the x- velocity by the y-boost seems counterintuitive, but a little thought 
makes sense of it. We know, from Einstein's ingenious arguments, that lengths of rods per- 
pendicular to a velocity vector arc unchanged by the relative motion. But lengths are nothing 
more than differences in positions; and positions are themselves the spatial components of 
the four-vector x^^. Thus, taking into account the universality of Lorentz covariance, Ein- 
stein's arguments imply that, for any four-vector, the spatial components perpendicular to 
the boost velocity are unchanged by the boost, as can be verified from Eq. (23). But the 
spatial momentum components are simply the spatial components of the four-vector p'^; 
therefore, p^ and p^ must be unaltered by a y-boost. And, indeed, the result (12) shows us 
that the x-momentum of the star was unchanged by the second boost: it remained —rri'jQVo. 
Rather, the energy of the star increased (due to its new y- velocity); and hence, by Eq. (7), 
its x-velocity decreased. To have an object maintain its momentum, but lose velocity, is 
nonrelativistically counterintuitive; but one can make sense of it by remembering that all 
velocities must remain smaller than that of light, and so for a large enough boost in the 
y-direction, any original velocity in the x-direction must be "quenched" (although not its 
momentum!). 

This asymmetry tells us that the non-commutativity of two non-coUinear boosts is more 
complicated than is widely appreciated: one not only finds a relative Thomas rotation be- 
tween the two resulting frames, but furthermore the resulting frames are not even moving 
with the same velocity. This is the source of the result expressed in Eq. (11). There we con- 
sidered a sequence of four boosts which we naively expected to return us to our initial state 
of motion. However, the first two boosts do not combine to give a pure Lorentz boost, but 
rather involve a Thomas rotation. This rotation is not compensated by the later boosts — 
indeed, there is a further rotation in the same direction. Thus, we should not be surprised 
that the sequence of four boosts gives the counterintuitive result of Eq. (11). 

This fundamental asymmetry is "hidden" in many introductory accounts of the addition 
of non-collincar velocities, by means of a judicious mixing of an active transformation for one 
velocity (i.e., the object is considered to be boosted, with we as observers being kept at rest) 
together with a passive transformation for the other (i.e., we as observers are being boosted), 
rather than two successive passive transformations as used in this paper. This "trick" gives 
the illusion of a greater degree of symmetry than is generally the case. (Einstein's seminal 
1905 paper uses this "trick".) 

All of these various points must be kept in mind if one wishes to analyze Thomas rotations 
in full generality. Any "closed" sequence of finite boosts (i.e., that returns us to a frame at 
rest relative to the original frame) will, in general, result in a Thomas rotation. Any such 
closed sequence may be broken down into a succession of closed sequences, each consisting 
of three boosts, in the same way that any arbitrary polygon (not necessarily planar) can be 
broken down into a "triangular mesh" by the addition of internal edges. Thus, the basic 
"building block" of a finite Thomas rotation is a sequence of three pure boosts: the first two 
are arbitrary, and the third must be chosen so as to make the sequence "closed". The first 
two velocities, then, determine the Thomas rotation for this "building block". Complicating 
such calculations, however, is the fact that the "sum" Vi2 of two arbitrary velocities Vi and 
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V2 is, in the general case, quite a complicated function of the first two velocities: 

Vl2 = / r\vi + -f2V2 + —^^{vrV2)v2\. (25) 

72(1 + ■^1-^2) [ 72+1 J 

(We can clearly see here the asymmetry between the two velocities Vi and V2; it is only if Vi 
and V2 are collinear that (25) becomes symmetrical under their interchange, and reproduces 
the usual formula (9) for the relativistic addition of velocities, as a short calculation shows.) 
The expression for the Thomas rotation is, in turn, even more complicated. Let us assume 
that we have an arbitrary three- vector r in our initial frame. After the sequence of pure 
boosts B{vi), B{v2), and B{—Vi2), the three- vector r is rotated to 

, , 7172(^^1X^2) xr - Q 

r=r + - ^ -—^ ^, (26a) 

1 + 7172(1 + ^1-^2)' ^ ' 

where 

_ llill - l){vvr)vi + ^l{nl - I){v2-r)v2 - 2^l^l{vrV2){vrr)v2 . 

^ - (7i+l)(72+l) ■ ^'''^ 

(Again, Q has no particular symmetry under the interchange of Vi and V2-) It can be verified, 
after some algebra, that r'^ = r^, i.e., that r' is indeed simply a rotation of r in three-space. 
If V2 is small (but Vi arbitrarily large), then the expression Q of Eq. (26b) is of order v\, and 
hence is negligible in the context of Eq. (26a). If we are considering the continuous Thomas 
precession, then we can set Vi — v and V2 — Sv2- Then, to quantities of first order, Eq. (25) 
yields 

6v = V12— Vi — 6v2 — {v-6v2)v. 

Thus, if 6v2 is perpendicular to the velocity v, then 5v = 6V2; but if 6V2 is parallel to v, 
then one must take into account the fact that the velocity must remain smaller than that of 
light. On the other hand, in all cases we have vxSv — vx6v2, so that Eq. (26a) yields, to 
first order, 

7 

5r = r'—r^ {vxSv)xr, (27) 

7 + 1 

which is the standard expression for the Thomas precession.^ (To compare with Jackson's 
result following his (11.117), note that our 5v is his A (3, and that our vxSv is, in his 
notation, j3xA(3 = ^ j3x5f3.) 

If we now consider the ultrarelativistic limit of Eq. (27), then we find something remark- 
able. This limit may be taken to be defined by the relations 

7^00, v^^l, v-5v^Q, (28) 

the latter two of which simply reflect the fact that the velocity is at all times almost the 
speed of light, and (hence) that any changes 5v to the velocity v must be perpendicular 
to V. In this limit, Eq. (27) becomes 

5r {vx5v)xr. (29) 
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Consider, now, the expression {vx6v)xv. By a standard three- vector identity, we have 

{vx5v)xv = v^5v — {v-5v)v, 

which, on account of the relations (28), tells us that, in the ultrarelativistic limit, 

5v {vx5v)xv. (30) 

Comparing (29) and (30), wc thus find that r and v arc being rotated by the same amount 
about the same axis. Recalling our discussion above that the direction of the Thomas 
rotation of the axes is opposite to the rotation of r relative to these axes, we therefore find 
that we have proved the following remarkable theorem: For any ultrarelativistic object, the 
Thomas rotation is equal and opposite to the orbital rotation. 

This theorem explains why we obtained a rotation angle of 90° for our sequence of four 
boosts in the ultrarelativistic limit. For we can think of any finite boost as simply a sequence 
of infinitesimal boosts in the same direction. For our first {+x) boost, we simply boosted 
the Enterprisers velocity to ultrarelativistic speeds. The second {+y) boost was designed to 
bring the Enterprisers velocity around to a 45° angle between the +x and +y directions; and 
the third (— .t) boost to bring it around another 45° to the +y direction. The final {—y) boost 
was antiparallel to this velocity, and simply brought the Enterprise back to rest. Thus, the 
velocity of the Enterprise, relative to a fixed observer, was rotated by 90° at ultrarelativistic 
speeds; and hence, by the above theorem, the Thomas rotation is just 90°, which is what we 
found by elementary means above. 



V. An advanced "paradox": polarization properties of scattering events 

Let us now consider a more advanced situation: the calculation of a polarized cross- 
section in quantum field theory. For simplicity, let us consider the scattering of a Dirac 
electron by the (idealized) fixed Coulomb field of an infinitely heavy, pointlike nucleus. For 
definiteness, we shall follow the notation and conventions employed in the introductory 
textbook by Mandl and Shaw.^^ In any frame for which the scattered electron momentum p' 
has the same magnitude as the incident momentum p (i.e., for which the electron's energy 
is unchanged by the scattering), the fully polarized cross-section is given by^^ 

- © - (^JWUM)?, (31) 

\ /rs 

where m is the mass and — e the charge of the electron, A4rs is the Feynman amplitude for 
the process, q = p' — p is the momentum transfer, Ae{q) is the "external" electromagnetic 
field (i.e., the Coulomb field of the nucleus) in momentum space, and 4e = ^eT^* where 7^ 
are the Dirac gamma matrices (not to be confused with the factor 7 defined in Eq. (2)). 
The indices r and s (= 1, 2) label the two possible spin states of the incident and scattered 
electron respectively. 

Let us first calculate all of the polarized cross-sections for the following scenario. We 
choose an inertial frame in which the nucleus is at rest, so that^^ 
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which under a Fourier transform yields 



Ze 



3,0,0,0 , 



where Ze is the charge of the nucleus. We then have^^ 

{ da\ {2maZf 



Usip'h'urip)]', 



(32) 



where a = e^/An is the fine-structure constant. Let us consider the case when the incident 
electron has velocity components 



Vy = 0, 



V. = +3, 



and the scattered electron has velocity components 



V. = +3, 



Vy = 0, 



V. = +3, 



so that the electron is being scattered by 90° in the z-x plane. From Eq. (2) we find that 
7 = 3, so that the incident and scattered four- momenta have the components 



/ 3m \ 
—2m 


V 2m / 



P 



/ 3m \ 
2m 


\2m ) 



and \q\ = Am. Now, the positive-energy spin-momentum eigenstates are, in the Dirac-Pauli 
representation of the Dirac matrices, given by^^ 



ui{p) = Ci 



1 





C2P 



I 




1 



C2{p^ — ip"^) 



(33) 



where 



Ci = 



lE + m 
2m ' 



C2 



E + m' 



(34) 



where Ui (^2) is the spin-up (spin-down) eigenstate relative to the 2;-direction. The conjugate 
bispinor eigenstates, in this representation, are consequently 



ui{p) = Ci 



1 





-C2P 

V C2(-p^+ip^) J 



( 




1 



-C2(p'^+ipy) 
\ C2P' 



(35) 
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For our particular case, E = 3m, so ci = V2 and C2 = l/4m. For the incident electron we 
therefore have 



MP) 



1 

V2 



( 2 \ 



1 

v-iy 



1 
72 



/ \ 

2 

-1 

v-iy 



and for the scattered electron we have 



V2 



/ 2 \ 


-1 

v-iy 



1 

7i 



/ \ 
2 

-1 
V 1 y 



Let us now compute the cross-section (32). The quantity Us{p')^^Ur{p) is equal to 2 for no 
spin flip in the z-direction (i.e., for ui with wi, or for U2 with ^2), and is equal to ±1 for spin 
flip in the z-direction (i.e., for ui with 1^2, or for U2 with ui). We thus find that 



da , ■ n- \ Qi^Z^ 
-,(no spin nip) 



dQ 



16m^ ' 



da , . ^. , a^Z^ 
(spin nip) 



dQ 



(36) 



To obtain the unpolarized cross-section, we average over the initial spin states and sum over 
the final spin states in the standard way. This results in 



^/(unpolarized) = ]^Y.11\ 



da\ da n \ d,^ / n \ 5a^Z^ 



We can compare this result with the standard Mott scattering formula, 



da 



dQ 



7 (Mott) 



{aZf 



[l-v2sin2(^/2)]. 



by noting that, for our case, 9 = 90° so 9/2 = 45° and hence sin^(0/2) = 1/2; v'^ = 8/9; and 
E — 3m; which yields precisely the same result: 

da . 5a'^Z^ 

7 (Mott) 



dQ 



64m^ 



We may therefore be confident that we have not made any elementary mistakes in calculating 
the polarized cross-sections (36). 

Let us now compute these cross-sections from the point of view of a different inertial 
frame. Specifically, let us view the process from an inertial frame which moves along the 
positive 2;-axis with velocity 2/3 relative to the inertial frame used above. Applying 5^(2/3) 
to the components of p'^ and p'^, we find 



5.(2/3) 



/ 3m \ 
-2m 


V 2m y 



/ m\/5 \ 



—2m 





5.(2/3) 



/ 3m \ 




/ m\/5 \ 


2m 




2m 










V 2m yi 




^ y 



(37) 
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so that, from the point of view of this new frame, the electron travels in the negative-a; 
direction with energy E — mVd and speed 2/-\/5, and is then reflected elastically to travel 
in the positive-x direction with the same energy and speed. We also need to boost the 
components of the four-potential Ai^: 



Ze 



(3,0,0,-2), 



so that the equivalent expression to (32) for the polarized cross-section is 



dQ' 



5kl 



(38) 



(We would, in general, need to transform the argument q as well as the components A'^ 



under a Lorentz transformation. However, if we define 



P 



p^, then in the original 



frame \q\^ = —q^q^ because g° = 0, i.e., the electron energy is conserved. Since q'^q^ is a 
Lorentz scalar, then we find that |qp is invariant in any frame in which the electron energy 
is conserved — as is the case in the frame we have defined above.) Finally, from Eqs. (34) we 
find that, using the boosted momentum values (37), the constants Ci and C2 are given by 



C2 



so that for the incident electron we have 



ui{p) = 



V2(T+75) 



( \ 




V -2 / 



U2{P) 



V2(T+75) 



and for the scattered electron we have 



ui{p') 



V2(l + V^) 






-2 



Mp') 



V2(l+V^) 



\ 

f V5 

-2 
/ 



/ \ 

1+^5 
-2 




We now find that the quantity Us{p')'~i^Ur{p) is unity for no spin flip in the z-direction, but 
vanishes for spin flip. The quantity Us{p')'y'^Uj.{p), on the other hand, vanishes for no spin 
flip, but has the value ±2 for spin flip. Inserting these values into the expression (38), we 
find that 



da , „ , 9a'^Z^ 

7 (no spm flip) = 



dQ 



320m 



2' 



da . ^ . a^Z'^ 
7 (spm flip) 



dQ 



(39) 



We've struck another disaster! The coefficients 9/320 and 1/20 in (39) look nothing at all 
like the values 1/16 and 1/64 that we found in (36). But we have merely performed the 
same calculation in two different inertial frames! How on Earth could the value of the cross- 
section — ^which can be directly related to the number of particles that would be expected 
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to be measured in an appropriately configured experiment — depend on an arbitrary choice 
of theoretical viewpoint? For example, if we prepare a beam of incident electrons so that 
they are completely polarized in the z-direction, and filter the scattered electrons so that 
only those polarized in the ^-direction are detected, then what would the cross-section be: 
Q!^Zyi6m^or 9Q;^Z^/320m^? There cannot be two different answers! 

One might, at first glance, suspect that some trivial mistake or oversight has been made. 
However, the calculations above can be checked; they do not contain any arithmetical errors. 
Failing this, one might then suspect that we have not taken into account the transformation of 
the solid angle differential rfi?' under a Lorentz boost. However, if one checks the derivation^^ 
of the first of the relations (31), then one finds that it holds true in any elastic scattering of 
a single particle from an "external" field — essentially, the other kinematical factors happen 
to "cancel out" in this special class of scattering events. 

There is, of course, a simple way to confirm or refute any suspicion one might have about 
the veracity of the results (39): one need simply combine them to find the unpolarized cross- 
section. Surely, any trivial errors made in obtaining the results (39) would (in all but the 
most contrived of situations) render the unpolarized combination similarly erroneous. But 
we are now flabbergasted to find that 

9 11 1 _ 5 
320^20~T6^64~64' 
Thus, even though we have obtained two sets of irreconcilably contradictory polarized cross- 
sections, we find that their unpolarized combinations agree completely (and agree with the 
standard Mott formula)! 

What is going on? 

VI. Solution to the polarization "paradox" 

Let us now use the general discussion of Sec. IV to understand the polarization "paradox" 
of the previous section. The key flaw in the arguments presented above is the description 
"polarized in the z-direction". We have not specified whose z-direction is being used! The 
second calculation is simpler, in this regard, because the electron's final velocity is collinear 
with its initial velocity (i.e., it is in the same direction, but has the opposite sense). Thus, 
it is consistent for us to define "the ;2-direction" to be our 2;-axis, since all boosts to the 
electron's frames of reference are collinear. We can, say, prepare an electron polarized in the 
z-direction, and measure only those scattered electrons polarized in the z-direction, without 
ambiguity. 

The first calculation, on the other hand, is more subtle. In using the standard expressions 
(33) and (35), we are (implicitly) applying one single Lorentz boost from our frame of 
reference to the initial electron's frame, and another single Lorentz boost from our frame 
to the scattered electron's frame. These two boosts, however, are not coUinear; and so our 
description of events is different from how the electron would describe matters. (By giving 
the electron apparently human powers, we are of course imagining an observer traveling 
along with the electron.) In effect, the electron's very rest frame is Thom,as-rota,ted by the 
scattering event, relative to us. For example, imagine that the electron state docs not get 
spin-flipped, as determined by the electron itself. From our point of view, however, the 
direction of polarization of the electron has changed! 
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The lesson of this example is clear. If one has need to calculate relativistic polarized 
cross-sections explicitly, and if the incident and scattered momenta of the particles involved 
are not absolutely coUinear (and in most practical experiments they are not), then one 
must be extremely cautious about how one defines the spins or polarizations of the particles 
involved. In particular, kinematical and semi-classical arguments must be examined in fine 
detail, to ensure that the nonrelativistic concept of universality of orientation has not been 
inappropriately applied. 

Finally, we may use the expressions (25) and (26) to re-analyze these polarized cross- 
section calculations quantitatively. If we set Vi to be the initial electron velocity, namely, 
(—2/3,0,2/3), then it is straightforward to verify that a boost by V2 = (12/13,0,0) results 
in the correct final electron velocity of Vi2 = (2/3, 0, 2/3). If one sets r to be, say, (0, 0, 1), 
then, after some calculation, one finds that r' — (4/5, 0, 3/5). Thus, the electron's rest frame 
has been Thomas-rotated by an angle 9t — arctan(4/3) 53° in the z-x plane. If we now 
list the matrix elements corresponding to the polarized Feynman amplitudes found in Sec. V 
(rather than the cross-sections), then for the first and second frames of reference we found, 
respectively. 



(,)^2maZ/ 2-l\ ^ 2maZ / 3 -4 



|2 



where the rows in these matrices represent the z-component of the initial spin, and the 
columns the ^-component of the final spin. To reconcile these Feynman amplitudes, we 
need simply apply the Thomas rotation or its inverse to either the initial or the final spin 
state in the first frame of reference. Remembering that spinors transform under rotations 
by half-angles, and noting that cos(^^t/2) = 2/\/5 and sin(6'T/2) = we finally obtain 



/ 6t . Ot \ 
' cos — — sm — ' 
2 2 

Ox Ot 
sm — cos — 
2 2 y 



VII. Conclusions 

We have shown how the Thomas rotation of relativistic mechanics can be introduced, and 
its "paradoxical" nature discussed, at quite an introductory level; that resolving such "para- 
doxes" is not overly difficult; and that a general expression for arbitrary Thomas rotations 
can be obtained without excessive effort. We have also shown how this general result con- 
nects up with standard textbook accounts of the infinitesimal Thomas precession. We have 
endeavored to show that the ramifications of such effects are deep, and fundamental, and 
that they may also be of immediate practical importance in the analysis and interpretation 
of relativistic polarized scattering experiments. 

In the interests of keeping this discussion at an introductory level, we have refrained from 
utilizing more advanced theoretical concepts to explain or analyze the Thomas rotation more 
elegantly or concisely. For example, group-theoretical methods are hinted at in the above 
derivations, but are not made explicit. (See, for example, Ref. 5 for a thorough treatment 
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in these terms.) Boosts can be viewed as simply "rotations" between space and time; and 
since two rotations about different spatial axes do not, in general, commute, then one would 
(rightly) presume that two boosts in different directions do not commute either; this is 
another path to the Thomas rotation. Alternatively, one may make use of the concept of 
parallel transport — more familiar in the general theory of relativity, but equally applicable to 
boosts or accelerations in fiat spacetime — to arrive at the Thomas rotation by yet another 
path.^^ We beheve that all of these more abstract views of the Thomas rotation do, in 
fact, augment, rather than detract from, the elementary nature and beauty of the effect as 
described here. 
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